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MASSACHUSETTS 


ABSTRACT 


The  problem  of  designing  a  stationary  Gaussian  noise  pro¬ 
cess  of  fixed  variance  so  as  to  optimally  mask  the  possible 
presence  of  a  given  additive  stationary  Gaussian  signal  process 
is  considered.  A  sub-optimal  solution  is  obtained  by  minimizing 
the  divergence  distance  between  the  noise  and  signal-plus-noise 
processes.  Recursive  time  and  frequency  domain  expressions  for 
the  divergence  are  derived  in  terms  of  successive  auto-regressive 
approximations  of  the  processes.  For  short  observation  times, 
the  minimal  divergence  masking  problem  may  then  be  solved  by 
the  unconstrained  minimization  of  a  convex  -  and  recursively 
computable  -  function  in  the  time  domain.  For  long  observation 
times,  the  problem  reduces  to  that  of  minimizing  the  asymptotic 
divergence  rate.  This  problem  may  be  solved  in  the  frequency 
domain  by  straight-forward  algebraic  techniques.  A  number  of 
examples  are  given  which  illustrate  the  methodology. 
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I.  Introduction 


In  this  report  we  consider  some  novel  aspects  of  the 
problem  of  detecting  a  Gaussian  process  in  additive  Gaussian 
noise.  Let  the  integer  N>1  be  given  and  let  a  discrete 
stationary  Gaussian  process  s(t)  (the  signal  process)  be 
specified  by  its  mean  and  covariance  function  on  the  interval 
t  =  1,  2,  .  .  ,  N.  We  are  to  determine  —  under  a  mean  square 
energy  constraint  —  the  statistics  of  the  Gaussian  noise 
process  m(t)  (the  masker  process)  on  the  interval  1,  2,  .  .  ,  N 
which  maximizes  an  observers  probability  of  error  in  the 
binary  hypotheses  test:  masker  process  vs.  masker  process 
plus  signal  process.  We  refer  to  this  problem  as  the  Gaussian 
masking  problem.  The  solution  to  this  problem  is  of  practical 
interest  for  the  efficient  design  of  jamming  signals.  It 
also  enables  a  worst  case  analysis  of  detection  performance 
for  Gaussian  signals  where  the  noise  spectrum  is  not  specified 
in  advance. 

The  explicit  maximization  of  the  probability  of  error 
expression  for  this  problem  is  computationally  burdensome, 
even  for  small  N.  As  an  alternative,  we  obtain  the  masker 
process  which  minimizes  the  divergence  distance  measure 
[l,2]  between  the  masker  and  masker  plus  signal  processes. 
While  this  minimal  divergence  masker  (MDM)  solution  is  in 


general  sub-optimal/  we  emphasize  it  here  for  the  following 
two  reasons: 


(1)  it  offers  the  advantage  of  analytical  and 
computational  simplicity 

(2)  it  is  shown  in  [3]  that  for  large  N  and  small 
signal/masker  variance  ratios,  the  probability 
of  error,  Pe>  is  given  asymptotically  by 


1  r™  2 

=  nfz  f  exp  (-X  /2)  dx 
W  Jyj3/2 


where  J  is  the  divergence  between  the  m(t)  and 
m(t)+s(t)  processes.  Thus,  the  MDM  solution 
corresponds  to  the  optimal  solution  for  this  case. 


Section  1 1  gives  some  definitions  necessary  for  the 
succeeding  development.  In  Section  III,  recursive  time  and 
frequency  domain  expressions  for  the  divergence  between 
arbitrary  multivariate  stationary  Gaussian  time  series  are 
given  (Theorem  1) .  These  expressions  are  in  terms  of  the 
successive  auto- regressive  (AR)  approximations  (time  domain) 
and  corresponding  AR  spectral  approximations  (frequency  domain) 
of  the  processes.  Although  our  primary  motivation  for  deriving 
these  expressions  is  their  applicability  to  the  MDM  solution 
of  the  masking  problem,  these  expressions  have  intrinsic  interest. 
The  time  domain  expression  is  related  to  Akaike's  information 


criterion  [4]-  It  is  particularly  convenient  computationally 
since  it  may  be  evaluated  efficiently  using  the  multivariate 
Levinson  algorithm  [5].  The  frequency  domain  expression 
is  a  generalization  of  expressions  for  the  asymptotic  diver¬ 
gence  rate  given  in  References  6-9 .  For  completeness  a  new 
and  simpler  derivation  of  the  latter  expression  based  on  the 
convergence  properties  of  the  AR  spectral  approximation  is 
given  in  Theorem. 2. 

In  Section  IV,  we  apply  the  above  representations  to 
the  MDM  solution  of  the  Gaussian  masking  problem.  It  is  shown 
in  Theorem  3  that  for  finite  sample  sizes  (N<oo)  this  problem 
may  be  solved  in  the  time  domain  by  the  unconstrained  mini¬ 
mization  of  a  function  of  N-l  variables  which  has  a  unique 
local  minimum  (which  is  its  global  minimum) .  The  MDM  solution 
is  then  illustrated  graphically  for  a  number  of  numerical 
examples. 

The  asymptotic  case  (N-*oo)  is  analyzed  in  Section  V 
where  it  is  shown  that  the  MDM  solution  corresponds  to  the 
minimization  of  the  asymptotic  divergence  rate  between  m(t) 
and  m(t)+s(t).  The  unique  solution  to  this  problem  is  obtained 
in  the  frequency  domain  by  straight-forward  variational  tech¬ 
niques  (Theorem  4).  It  is  also  shown  (Theorem  5)  that  the  mean 
and  discrete  spectral  component  of  s(t)  is  irrelevant  to  the 
solution  of  the  asymptotic  masking  problem. 
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II. 


Preliminary  Definitions 


Let  X^and  be  two  real  random  vectors  with  mutually 
absolutely  continuous  probability  measures  and  respective 
density  functions  f ^ (X)  and  f2(X)*  Let  2 ^  = 
log[f ^ (X) /f 2 (X)]  denote  the  log-likelihood  function.  The 
directed  divergences,  1(1,2)  and  1(2,1)  and  the  divergence, 
J(l,2)  are  then  defined  by 


1(1,2) 

■  Ell>l,2] 

-/'•  'J  f  x  A1  ,  (X)  dx 

( 1.  a) 

1(2,1) 

"  E2l>2,l] 

(l.b) 

J  ( 1 , 2) 

=  1(1,2)  +  1(2,1) 

(2) 

where  E^t*]  denotes  the  expectation  under  hypotheses  k. 

These  measures  of  statistical  separability  were  first 
introduced  by  Jeffreys  and  have  found  wide  applicability 

to  problems  of  feature  extraction  [1]  and  optimal  signal 
design  [2].  1(1  ,2)  is  also  referred  to  as  the  relative  entropy 

or  Kullback-Liebler  Number  of  X^  with  respect  to  X2  antJ  may 
be  interpreted  as  the  mean  information  gained  from  the  obser¬ 
vation  X.^  for  discrimination  in  favor  of  hypothesis  1. 


For  the  case  where  the  observations  are  Gaussian  with 
respective  mean  vectors  and  covariance  matrices  Z_k,  1(1,2) 
is  given  by  ([l],  p.  189). 

+  h  tr  Z L  (z'1  -  Hi1) 

+  %  tr  Z"1  (j^-m^)  (P1-y2)T.  (3) 

For  the  case  of  interest  here  where  the  observations 
consist  of  N  samples  from  one  of  two  possible  stationary 
Gaussian  processes,  recursive  expressions  are  derived  for 
1(1,2)  which  are  more  computationally  efficient  than  (3)  and 
are  of  particular  value  in  the  context  of  the  masking  problem. 
These  expressions  involve  AR  approximations  to  the  processes. 
Definitions  of  the  requisite  quantities  are  briefly  summarized 
below. 

Consider  two  M-variate  discrete  time  stationary 
Gaussian  processes  J x k |  (k  =  1,2;  t=0,  ±1,  ±2,  .  .  .)  with 
and  matrix  covariance  functions  £k(£)  = 

Ek  [(-t+£,k  "  -k}  (-t,k  "  -k}  ]’  Let  — n,k  be 

the  block  toeplitz  matrix  defined  by 


means 


1*2 

1(1,2)  =  35  log 

1-1 
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— n,k 


Ck(0)  |  cj(l) 


Ckm  .ck(0> 


■+ 

I 

1 

I 


Ck(n-1)  I  Ck(n-2) 


I  T 

.cj(n-l) 


I  c.  (0) 

i  — k 


(4) 


where  E  ,  is  positive  definite  for  all  n.  Define  E  , 

— n ,  k  — n ,  k 

to  be  the  matrix  obtained  by  replacing  the  submatrix  elements 
Ck(&)  by  Ck(£)  =s  +  ddT  where  d  =  M2  -  M^. 

Let  Sk (A)  denote  the  M  x  M  power  spectral  matrix  of 
process  k,  given  by 

oo 

SkU)  =  ^  J  CkU)e'iXil,  -7T  <  A  <  7T  .  (5) 

1=  -oo 

Define  Sk(A)  to  be  the  spectral  matrix  of  the  process  with 
covariance  function  i.e. 

1^.(1)  =  ( A )  +  ddT  6(A)  .  (6) 
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Let  xfc  denote  either  process  1  or  2  and  let 
*1 


X  = 
— n 


x 

L-nj 

denote  n  successive  observations  of  the  process.  Define 
the  following  random  variables: 


x  i 

— n,k 


M, 

for  n=l 

— k 

E, 

[x  |  X 

k 

n 1  — n 

A 

x 

X  .  . 

— n 

— n,k 

(7) 


e  =  x.  -  x_  (8) 

x  .  is  the  filtered  estimate  under  hypothesis  k  of  x 

— n ,  js.  *  — n 

given  the  observation  Xn_i  and  £n  ^  the  corresponding 

error  process.  We  also  define  R  .  *  E.  Te  ,  e  T,  1  and 

n,k  k n , k  -n,kj 

©n  =  tr  ^  ^  respectively  as  the  n-th  step  error  covariance 
and  mean  square  error  under  hypothesis  k. 


The  (n-l)th  order  AR  representation  of  the  error 

process  is  then  defined  by 
n-1 

^jn  (-n-j  “  -kJ  = 

k=l ,  2 


(9) 


n- 1,2,3, • • 


7 


where  Ifc.  ( j=0 , 1 , . . .  ,n-l)  are  the  set  of  (n-l)th  order 

y  3  *  n  y 

partial  auto-regressive  coefficient  matrices  of  process  k 
(k) 

and  b'  =  I  without  loss  of  generality.  Define  the  M  x  Mn 

— 0  ,n 

matrix  B  .as 
$  x 


(k)  ] 
l,nj  * 


Similarly  the  (n-l)th  order  AR  approximation 
of  the  power  spectrum  is  defined  by 


£„,ka>  uo) 


where 


rn,k(x) 


n-l 

-E 

j=0 


(k)  -iA • j 
j  *  n  6 


(11) 


is  the  transfer  function  of  the  auto-regressive  filter 
defined  by  (9) .  Sn  ^  (A)  ke  s^own  to  be  the  spectrum  of 

the  "most  random"  process  whose  covariance  function  matches 
that  of  process  k  on  the  lag  interval  [-n+1,  n-l]  [ll]  . 


III.  Time  and  Frequency  Domain  Representations  for  the 
Directed  Divergence 

Let  f^lX^)  denote  the  joint  density  of  N  successive 
observations  under  hypothesis  k.  The  N-sample  directed 
divergence,  IN(1,2),  and  N-sample  divergence,  JN(1,2)  are  then 
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defined  by  analogy  with  (1)  and  (2)  as 


In(1,2) 

“  E1  [Al,2  (-N}] 

(12) 

V1'2) 

=  In(1,2)  +  In(2,1)  . 

(13) 

The  following  theorem  provides  recursive  formulas  for  the 
evaluation  of  (12)  and  (13)  . 


Theorem  1.  The  N-sample  directed  divergence  IN(1,2)  has 
the  following  auto-regressive  representations: 


1.  Time  domain 


N 

I  (1,2)  =  -  t  +  I  E  ltr  B  •>  L  -I  B  T  R  "J 
N  2  2  1 1  I  -n,2rn,l  *n,2  -n,2 

n=l 


-  log 


I5n.ll  1 
ISnf2l  J 


(14) 


2.  Frequency  domain 

v1'21  -  -  ¥ +  hf"  T,  O*’) 

-1T  n=l  L 

I  £nl(X>  II 

'  9  is“2(»ir  (15> 


Proof:  We  first  derive  (14).  Let  f,_  (x_|x_  ,)  denote  the 

conditional  density  of  xn  given  Xn-1  under  hypothesis  k. 

2 (*n)  may  be  expressed  recursively  as 
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Al,2(-n}  =  Al,2(-n-l)  +  log  [f  1  (~n  ^  ~n-l) /f  2  (?in  ^-l5] 


from  which  it  follows  that 


I  (1,2)  =  I  ,(1,2)  +  A  ,  n=l, 2 , .  .  . 
n  n— J.  n 

where  IQ=0  and 

!E1[log  ~  log  '  n=1 

E^log  f^xJXn-l5  -  log  f  2  (xn  |  x^)  ]  ,  n>1 

Substituting 

‘k'Sn'Vl1  "  <2’>'M/2lEn,kr‘5 

eXp  h  “"A  En ',1  %,k] 

into  the  above  gives 

An  =  ^l  [~n,  2  — n,  2  ~n2  "  ~n,l  ^,1  ~nl 

-  lo9  {ISnll/l£n,2l,]=  -  h  log(|Rnl|/|Rn2|) 

+  15  El[4  *n~l  — n2] 


where  the  expectation  appearing  on  the  right  of  (16)  may 
be  evaluated  as 
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T 


-1 


.  ,2  — n ,  2  ,  2  J 

r  n-1  (2)  n-1  (2)  ,x  -x 

=  En  I  E  fe-i  n  )  -  <E  £  «  Rn  , 

1  U  j = 0  0 '  i  1  j=0  J '  ,2 

£  -i"2’  -  (-?o6i.(n1)al] 


[n-1  n-1  (2) 


=  tr 


,  E  E  b. 
Li=o  j=o 


in 


+  tr 


"n-1  n»=l  ( 

E  E  b 
Li=o  i=o  1 


r  T  |  ( 2)  T  - 

,  (x  .  -M, )  (x  .-M.)  b.  R 
1L  — n-i  —1  -n-]  -1'  J-jn  — n, 

.n 


n^l  n-1  (2)  T  (2) 

j 


-  dd  b .  R 

in  —  —  jn  ■— n 


=  tr 


n-1 

n-1 

(2) 

r  Ti 

(2) 1  -1 

E 

i=0 

E 

j=0 

b. 

-in 

cx(j-i)  +  dd 

b .  R  _ 

-j,n  -n, 2 

T 

-1 

B  o 
-n,2 

Sn, 

1  -n,2 

R  ? 

— n ,  2 

The  first  part  of  the  theorem,  (14) ,  follows  from  the  latter 
result  and  (16) . 

We  now  derive  (15) .  Let  again  represent  either 
process  1  or  2  and  denote  its  mean  by  M  and  its  spectrum 
by  S(A).  Let  k  be  fixed  and  define  £(A)  =  S(A) 

+  (M-M^)  (M-Mj^) T  6(A).  Let  n  >0  be  fixed  and  consider  the 
process 


?tk'n)="to  (st-j-B-tBk-B)) 


t=0,±l,±2, 


11 


*  '*  Pair.  ■,  — 


The  spectrum  of  y^(k'n)  is  given  by 

S^'n,u>  -  r  k(X>[s(i)  +  (M-Mk)(M-Mk)T!(X)]r;  k(X) 

rnikM  S(A)  rSik(x). 

Applying  Parseval's  theorem  and  (10)  gives 

E[ytk'n)T  s„;i  ytk,n)]=tr  s„;x  E[itk,n)  2tk,n)T] 

.trfl  -1  S(k-' 

J  -n*k  -y 


(X)  dX 


-it 

IT 


kcxi  scxi  rn>k(x)dx 


=tr  f R  }  T 
I- n,k  -n, 

-TT 

-«/ Sn'^rn.k'X^n.x'*'  4a'11  l(X>)rS,k<«^ 

-TT 

TT 

[sn;k  rn  k(x)  §n ik(x)r;  k(x)J  sm  s„;J(xidx 

-TT 

y 

-j.tr_/  i(x>  sn;k' 


(X) dX . 


From  (9)  we  have,  letting  t=n: 
,,  (k ,  n)  _  n'f«1  v.  (k) 


-n 


=  5T  b.  (x  ,-M,)=e  ,  . 

jto  ~3  f n  -n-3  -K  ~n,k 
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Letting  x  =x  ,  and  k  be  alternately  1  or  2  gives 

il  II  ;  1 

El[fJ  SnI1  (17> 

~1T 

El[i„2  2^2  £„2]=itry  il(1>  2n2  <X)dX-  (18) 

-If 

Substituting  (18)  into  (16)  and  the  expression  for  1^(1, 2)  gives 

V1'2’"*?/  |=1[tr(Ii<x)  Ini1  <x>)-»' 


To  show  that 


log  R  . 

^  i  _ni 


=  hf"10*  !snltx)  |ax  , 

•' -TT 


note  that  Snl<X)  is  the  spectrum  of  the  (n-l)th  order 

AR  process  Zfc  defined  by 
n-1  /i  \ 

Y  b.  '  Z  =  l  t=0 , ±1 , ±2 ,  ... 

jto  ~3n  -t-:  -t 

where  is  a  sequence  of  identically  distributed 
uncorrelated  random  variables  with 


[it  V  -  — nl  ' 


t-0 , ±1, ±2 , . . . 


LetJImy  denote  the  covariance  matrix  of  m  successive 
observations  of  Zfc  and  let  Rmy  be  the  associated  mth  step 
error  covariance.  We  have  [l2] 


I -my I  “l5nyl/l£m-l,yl 


m=l, 2 , . . . 


However,  from  (20) , 


R  =  , 
-my  i 


R  , 
— nl 


m<n 


m>n 


It  follows  from  the  above  and  a  result  in  [13],  pg.  66  that 

i  r11' 

log|Rnil=  lim  logl£mvl=  2t r/  i°g|sYU)|ax 
m^°°  IT  4 

=  jvj  logl£ni(*) ldA 

-IT 

which  is  (19) . 

We  note  that  the  relationship  between  the  divergence 

and  linear  predictive  filtering  for  arbitrary  (not  necessarily 

stationary)  Gaussian  processes  was  first  shown  by  Schweppe 

[14].  Equation  (14)  may  be  considered  to  be  a  special  case 

of  Schweppe' s  result  when  applied  to  the  stationary  case. 

The  primary  value  of  (14)  is  that  it  enables  the 

efficient  computation  of  I  (1,2)  and  JN(1,2)  recursively 

by  application  of  the  Levinson  algorithm  which  generates 

B  ,  and  R  .  in  terms  of  B  .  . ,  R  .  .  and  C 
-n,i  -n,i  -n-1, -n-1, i  — n,i 

The  general  multivariate  version  involves  the  solution 


14 


ft'"** : 


of  both  a  forward  and  backward  estimation  problem  and  is 
described  in  fel5].  Note  that  (14)  ,  when  implemented  with 
the  Levinson  algorithm,  requires  operations  on  MxM  matrices. 
This  compares  with  the  direct  evaluation  of  (3)  which  requires 
operations  on  MN  x  MN  matrices. 

The  Asymptotic  Case 

For  large  N,  more  useful  measures  of  statistical 
separation  than  (12)  and  (13)  are  the  (asymptotic)  entropy 
rate  and  divergence  rate  [  6  -  9  J  defined  respectively  by 


R  (1,2)  =  lim  I  (1,2)/N 

1  N— 00  N 

(21) 

R  ( 1,2)  =  lim  Jm(1,2)/N. 

J  N— «oo  N 

(22) 

For  completeness  we  use  the  result  (15)  and  the  convergence 
property  of  the  AR  spectral  approximation  to  obtain  an  alterna¬ 
tive  derivation  of  a  theorem,  first  stated  (without  proof)  by 
Pinsker  [6]  .  A  more  complicated  proof  based  on  the  triangular 
factorization  of  J])"1  i-n  (3)  is  provided  in  [7]  .  For  convenience, 
we  state  and  prove  the  theorem  for  the  scalar  case.  The 
generalization  to  the  multivariate  case  is  straight-forward. 

Theorem  2.  Let  {xt  ,}  and  {xt  .,}  of  Section  II  be  scalar, 

— — — — —  W -L 

zero-mean  processes  with  respective  spectral  densities  S^(A) 
and  S2(X)  where  ( A)  is  bounded  and  S2 (A)  is  continuous  and 
strictly  positive  on  — tt <  A  <  tt  .  Then 


-  -  - 
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Rj(l,2)  =  ¥(1,2) 


where 


¥(1,2)  =  f  j^S1(X)S21(X)-l-log(si(X)S21(X))jdX.  (23) 

The  proof  of  (23)  given  below  utilizes  the  result 

that  if  {gn (x) }  is  a  sequence  of  non-negative  uniformly 

integrable  functions1  on  [a, b]  with  lim  g  (x)  =  g(x)  (a.e.), 

n— qo  n 

then 

lim  fh  g  (x)  dx  =  J  b  g(x)  dx  (24) 

n—oo-'  a 


Proof.  From  (15)  we  have 


(1'2)  =  [SlU>Sn2(X,-1-los(Snl(X,Sn2<X>)]dX- 


For  the  logarithmic  component  of  the  above  expression  we  have 

lim  -£=■  Z*77  log  S  .  (X)  dX  =  h  lim  log  R  . 

— J  ni  _  m 


n-«oo  —  7T 


n-*oo 


i  i i  I  ^ni  I  it.  i  IS  ni 
h  lim  log  - -  =  *5  lim  log 


— *  iV  i 

n-»oo  l^n-l,i  | 

1  C 77 

JZ  J  log  S.(X)dX 

-IT 


n— oo 


n 


1  A  sequence  of  non-negative  functions  (gn(x))  is  said  to 
be  uniformly  integrable  on  J^b]  if  for  e>0  ,  there  exists 
a  5>0  such  that  for  all  n,/A  gn(x)dx<e  for  any  set 
AC[a,b]  with  measure  m(A)<5  . 
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where  to  obtain  the  last  result  we  have  again  utilized 
the  theorem  on  p.  66  of  [l3]  . 

It  remains  to  be  shown  that 


lim  f  S^Ajs'jUJdX  =  f  S1(X)S~1(X)dX 
n— ao  -it  —it 


We  note  that 

lim  S- (X) S~i (X)  =  1  for  all  X 
_  i  nz 

n— *oo 

and 

h  f"  S2<X)Sn2(X)  '  1  for  a11  n 

-TV 

where  (26)  follows  from  the  convergence  property  of  the  AR 
spectral  approximation  and  (27)  is  identical  with  (17). 
Equations  (26)  and  (27)  imply  that  the  sequence  gn(X) 

=  S2(X)/Sn2(X)  is  uniformly  integrable  on  [-it  ,  ir].  To  see 
this,  let  6  be  an  arbitrarity  small  positive  number  and  let 

En  =  |A:!gnU)  • 

It  follows  from  (2  6)  that  m(E  )  =  2ir-a  where  lim  a  =0. 

n  n  n 

n—oo 

For  any  set  AC[o,2tt]  we  then  have 


(25) 


(26) 


(27) 
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fh  g  (X)dX  <fknE  g  Max  +f  gn<X)dX 

n  En 

gn(X)dX 
n 

<  (l+3)m(A)  +  2tt  -  (1-0)  (2ir-a  ) 

n 

<  (l+8)m(A)  +  2tt 3  +  a  . 

n 

Now,  let  the  e  of  the  uniform  integrability  definition  be 

given.  It  follows  from  the  last  inequality  that  there  is  an 

N  such  that  for  n>N  ,  fA gn(X)dX<e  if  m(A)<6*  where 

6*  =  (e-2ir0-aN) /(1+3) .  For  each  n<N  ,  there  will  exist  a 

6n  such  that  JA gn(A)dA<e  for  all  A  with  m(A)<6n  (  [l6]  ,  p.  85). 

Choose  6  in  the  uniform  integrability  definition  to  be 

6  =  min  (5*,  <5^,  .  .,  <5^  ^}. 

The  uniform  integrability  of  f gn ( X ) }  implies  the 
uniform  integrability  of  S^XJS’^X)  since  by  the  assumptions 
of  Theorem  2  there  exists  a  C<«  such  that  S.^  (X)/S2  (X)<C 
for  -7r<X<Tr.  Then 

fh  Sl(X)Sn2  (X)  dX  =  fA  S^^S^fX)  gn(X)dX 

£  0  /A  9„<x)dx  . 

(25)  then  follows.  This  completes  the  proof  of  Theorem  2. 


=  JmEn  9n  (X)  dX  gn(X)dX  -jf 
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The  following  corollary  is  an  extension  of  (25)  and  is 
used  in  the  treatment  of  the  asymptotic  masking  problem  in  Sec¬ 
tion  V. 

Corollary.  Let  S^A)  be  bounded  and  let  S2(X)  be  of  the  form 

a,  2 

S-U)  =  s-  (A)  +  £  d.  <S  ( A-A  . ) 

2  2 ,  c  -Ti  i  i 

where  S2  c  is  continuous  and  strictly  positive  on[0,2ir]  and  the 

sum  on  the  right  corresponds  to  a  finite  number  of  discrete 

2 

contributions  to  the  spectrum  of  power  d^  at  the  frequency 
A^  (i=l, 2, . . q<ao  )  .  Then 


-l  r  ^  -l 

lim  /  S. (A)  S  (A) dA  =  /  S. (A)  S_  ( A)  dA  . 
n— *00  —  7T  •'—IT  ZfC 


(28) 


Proof. 


„  =  S  S  ~.  Now 
n,c  2,c  n2 


We  have  S.  S  =  S.  S~  g  where 
1  n2  1  2,c  ^n,c 

-1 


lim  g„  „  =  1  (a. e. )  , 

n-oo  n»c 


and 


lim  f  g  dA  =  lim  fs.,-]£d.2  6  ( A  -  A  .  )1  S  "1 
n—  oo  *'_tt  n* c  n  -*oo  J-v  I  2  x  i  I  n2 

=  2tt  lim  1-  ^  d.2  S  ~^(A.)  =  2tt 

n  ->  oo  I  i~|_  1  n2  i  I 


dA 


The  identical  argument  used  for  the  proof  of  (25)  may  now  be  re¬ 
peated  to  show  that  {gnc),  and  thus  (Sx  Sn2>,  are  uniformly  inte 
grable.  It  then  follows  that 
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IV. 


The  Minimal  Divergence  Masker 

In  the  following,  we  apply  the  results  of  Section  III 
to  the  MDM  solution  of  the  Gaussian  masking  problem  outlined 
in  the  Introduction.  Specifically,  let  the  signal  process 
s(t)  be  a  given  scalar  stationary  Gaussian  process  with  mean 
u  and  covariance  function  C  (Jl)  .  We  wish  to  hide  the  possible 
presence  of  s(t)  by  imbedding  it  in  another  stationary 
Gaussian  process  m(t)  (the  "masker  process")  which  is  indepen¬ 
dent  of  s ( t)  and  has  a  fixed  mean  u  and  fixed  variance  V  . 

m  m 

The  optimal  solution  of  the  masking  problem  is  provided  by 

that  masker  covariance  function  C  (£)  on  the  lag  interval 

m 

l  =  1,2,  . . . ,N-1  which  maximizes  the  probability  of  error 
Pe  in  the  binary  hypotheses  test 

H^:  x(t)  =  m(t)  (signal  not  present) 

H2:  x(t)  =  m(t)+s(t)  (signal  present) 

t=l ,  2 , . . , N 

where  (x(t);  t=l,2,...,N}  represents  N  observations  from  one 
of  the  two  hypotheses  and  both  hypotheses  are  considered 
equally  likely. 

Letting  =  (x^  x2,  ...  ,  xN)T,  M^  and  M^s  denote 
N-vectors  all  of  whose  components  are  and  yg  respectively, 
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-  ■■***? 


and  A  (XN)  =  log  P2  ^1  denote  the  log-likelihood  ratio, 


we  have 


Pe  -  H/.»A£2(A,dA+A0 


where  f^(A)  is  the  density  of  A  under  hypotheses  i  and  A  is 
given  by  the  quadratic  form 

A<V  ’  *  109  jfel  '  ^  “SnM^WVs'] 

[~N  ^N,m+^N,s^]  +  15  tr  ?N,  1 

where  EN#1  =  £N/In  ,  £N/2  =  SN,m  +  5NfS  and  and 

gN>s  are  the  analogues  of  (4). 

The  complicated  nature  of  the  densities  f^A)  make 
the  minimization  of  P  with  respect  to  C  U)  (4=1, 2 , . . ,N-1) 
numerically  difficult.  A  sub-optimal  solution  to  this  problem 
may  be  obtained  by  determining  that  masking  covariance  which 
minimizes  the  divergence  JN(1,2)  =  E^AJ-E^A)  between  the 
masking  and  masking  plus  signal  processes.  This  minimization 
—  involving  only  first  moments  of  A  —  is  considerably 
simpler  than  that  required  for  (29).  However,  the  possibly 
complex  nature  of  the  (N-l)  dimensional  surface  of  JN(1,2) 
(considered  as  a  function  of  cM(l) ,  CM(2) ,  ...  ,  C  (N-l)) 
may  still  make  the  minimization  difficult  -  particularly  for 
large  N.  Fortunately,  the  latter  issue  is  of  little  concern 
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as  the  function  has  only  one  local  minimum  (which  is  the  unique 
global  minimum) .  This  is  shown  in  the  following  theorem 
which  is  valid  for  arbitrary  Gaussian  vectors. 


Theorem  3.  Let  Y  (the  masker)  and  Y„  (the  signal)  be 
-  —m  — s 

two  independent  N-dimensional  Gaussian  vectors  with 

means  a  and  a  and  positive  definite  covariance 
-m  -s 

matrices  K  and  K  where  a  ,  a  and  K  are  fixed  and  K  is 

—m  — s  —  m  ~  s  — s  — m 

allowed  to  vary.  Let  Y1  =  Y  and  Y_  =  Y  +Y  .  Then  the 

— 1  — m  — 2  — m  s 

divergence  J(l,2)  4  J (Km)  between  Y^  and  Y 2  is  a  strictly 

convex  function  of  K  ,  i.e.  for  O<0<1  and  arbitrary  covariances 

-m 


— m' 


K/ 

— m 


with  K  /  K': 
— m  —m 


J(0K  +  ( 1 - 0 ) K' ) <  0  J ( K  )  +  (1-0)J(K:) 
— m  — m  -m  — m 


(30) 


Proof:  The  proof  of  (30)  involves  the  simultaneous  diagonali- 

zation  of  a  judicious  choice  of  two  matrices  associated  with 

K  ,  K'  and  K  .  Let  K  and  K'  be  fixed  and  consider  the  func- 

-m  — m  — s  — m  —  m 

tion 

D  ( 0 )  =  J  ( 0K  +  (1-0)  K')  ,  O<0<1. 

“to  ~m 

2  2 

Our  approach  will  be  to  show  that  d  D(8)/d0  >0  for  all 


0  6(0,1).  We  have  from  (2)  and  (3)  that 


Thus 


where 


J (K  )  =  h  a  T  F(K  +Ke) -1  +  K'1  1  a 
-m  s  ^  — m  s  in  J  s 

+  h  tr  [ (K  +K  ) _1  K  +K_1  (K  +K  )-2ll 
[_  -m  -s  — m  -m  — m  -s  -J 

=  35  aT  T  (K  +K  )  ~1  +  K-1  1  a 
— s  l  -m  — s  -m  J  — s 

+  h  tr  [k~1K  -  (K+K) -1  K  I  . 

L  m  "s  “in  ~s  — s  J 


D  ( 0 )  =  D^G)  +  D2(0) 


D1(0)  =  *5  01^  [(H(0)+Ks)_1  +  H~1(e)ja^ 
D2(0)  =  h  tr  [h-1(0)Ks  -  (H(e)+Ks)_1ISs] 


H  ( 0 )  =  0K  +  (1-0)K" 
—  — m  —m 


Let  0q  with  O<0q<1  be  fixed  and  apply  a  non-singular 
linear  transformation  T  =  T.(0O)  suc^  that 


T  (h(0q))  TT  =  A 


T  KgT  =  I 


where  A  is  a  diagonal  matrix.  For  arbitrary  9,  let  D(0) , 

D x  ( 0 ) ,  D2<0) ,  H(9) ,  ag  and  Kg  denote  D ( 0 ) ,  D1(0) ,  D2 (0) ,  H(0) , 
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p»r-~ 


a  and  K  respectively,  under  the  transformation  T(0_). 

S  *“S  u 

From  the  invariance  of  the  divergence  to  non- singular  linear 
transformations  it  follows  that 
D(Q)  =  D(8)  ,  0<  0<  1 

and  in  particular  that 

d2  D/d02  =  d2(D1  +  D2)/d02  =  d2D/d02  (33) 

o<  e<  i 

2  —  2 

Evaluating  d  D-^/de  by  repeated  application  of  the  inverse 

matrix  differentiation  rule  gives 

2  —  2  T  —  —  -1  T _ 1 

d  D,/d0  =  V  (H+K  )  V  +  u  H  U 

1  ^  —  ™  S  “  “■  ■"  “ 

where 

V  =  (Km-K') (H+K)'1  a 

U  =  (K  -K')  H_1  a  . 

-  -m  — m  -  — s 

Since  H  and  H+Kg  are  symmetric  and  positive  definite,  it 
follows  that  H  _1  and  (H+K  )  1  are  symmetric  and  positive 

mm  mm 

definite.  Thus 

d2  D1/d02>O  ,  O<0<1  .  (34) 

Similarly,  differentiation  of  gives 

d2  D-/d02  =  tr{(K  -KJ2  fTh  ”3  -  (H+K  J"3 

2  |  —  m  — m  ~s  —  — s 
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Evaluating  the  latter  expression  at  0Q  and  using 
(31)  and  (32)  gives 

a2  Vde2le=e0  -  tr  B. 

-3  -  3 

where  R  =  A  -  (I+A)  is  a  diagonal,  positive  definite 
_____  2 

matrix  and  (K  -K')  is  non-negative  definite.  We  now  show 
-m  in 

that 

a2  Vde2U=e0>0  • 

Evaluating  (35)  gives  N 

a2Vde2le-e  «u'>u 

0  i=l 

—  —  2 

where  6. .  and  v. .  are  the  i-th  diagonal  elements  of  (K  -K') 
li  n  — m  — m 

and  R  respectively. 

_  _  2  _  _ 

(k.  -k.'  )  >0  where  k.  _  and  k/  are 

j-o.  e  le  le  -  i,e  le 

the  elements  of  Km,  respectively,  and  by  hypotheses 

K  *Y.',  it  follows  that  at  least  one  of  the  3-  .  >0  and  (36) 
— m  -m  n 

follows. 

We  conclude  from  (33) ,  (34)  and  (36)  that 
d2  D/de2| 0=e^>  o  . 

Since  6g  is  arbitrary,  (37)  is  true  for  all  0  with  0<6<1. 
Now  consider  the  function  g(9)  =  D(0) -0D(1) -(1-0)D(O) . 


(35) 


(36) 


(37) 
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We  have  g"(0)>O  for  O<0<1,  g  ( 0 )  continuous  for  O<0<1,  and 
g(0)  =  g(l)  =0.  It  follows  from  these  properties  that 
g(0)<O  for  O<0<1,  or,  equivalently, 

D ( 0 )  <  0D (0)  +  (l-e)D(l) 

which  is  (30).  This  completes  the  proof  of  Theorem  3. 

We  now  return  to  the  MDM  problem  as  formulated  for 
the  stationary  case. 

We  note  that  the  divergence,  JN(1,2)  between  the 
m(t)  and  m(t)  +  s(t)  processes  may  be  expressed  using  (2) 
and  (14)  as 


JNd,2) 


=  -N  + 


h 


V" 


y  B  * 

n;ra+s  — nm  -n;m+s 


e 


-1 

n;m+s 


B 

-n;  m 


(5 


nm 


t  )  B'  e-1  " 
—ns  /  — n;m  n;m  _ 


where:  and  Sns  are  °^ta^nec^  by  adding  to  each 

element  of  _ and  respectively;  B  and  e  are  the 

-  nm  —ns  c  2  -n; m  n;m 

(n-l)th  order  AR  parameters  associated  with  m(t) ;  and  B  „ , 

— n; m+s 

and  en.m+s  are  the  (n-1) th  order  AR  parameters  associated 
with  m(t)  +  s  ( t)  . 

From  the  Yule  Walker  equations  [l7] 


B 

-n;m 


e 

n;m 
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and  the  above  expression  simplifies  to  the  following  symmetric 


form 


(38) 


The  MDM  process  is  then  specified  by  the  N-l 

A  A  A 

quantities  C  (1),  C  (2),  .  .  C  (N-l)  which  minimize  (38). 

If  this  minimization  is  performed  directly  on  the  covariances 
{ Cm ( fi. ) }  it  is  a  problem  of  constrained  optimization  in  that 
the  solution  must  satisfy  the  positive  definiteness  constraint 


N  N 

E  E 


i=l  j  =  l 


Vj  >0 


(39) 


for  any  set  of  numbers  (&^,  &2'  •  •  •  t  ^  N)  not  all  0. 
Fortunately  there  exists  an  alternative  parameterization  of 
m(t)  on  the  interval  t=l,2,...,N  in  terms  of  which  the  minimi¬ 
zation  of  (38)  is  a  problem  of  unconstrained  minimization. 

This  is  the  parameterization  in  terms  of  (p.  .  p«  ,  , 

l ,m  z ,m 

p„  .  )  where  p  .  is  the  j-th  partial  correlation  coefficient 

N-l , m  j ,m  K - 

of  m(t)  .  The  major  advantage  to  the  latter  parameterization 
is  that,  in  contrast  to  (39),  the  stationarity  constraint 
on  the  p .  is  considerably  simpler  and  is  given  by  [17] 


t  P  j ,  in  I  <1  j  —  1#2,  N-l  .  (40) 

(38)  may  be  computed  recursively  in  terms  of  the  {p.  } 

3  »m 

using  the  Levinson  recursion.  It  may  also  be  shown  that 

A  A 

the  solution  (p,  . ..,  p.T  ,  )  which  minimizes  (38)  in  the 

i,m,  N-i,m 

closed  (N-l) -dimensional  cube  |p^  ...  |  <  1  (n=l, 2 , . . ,N~1)  is 

j  /  rn 

in  the  interior  of  the  cube.  Since  the  mapping  from 

(c  (1),  C  (N-l))  to  (p.  m  . ..,  pM  .  )  is  one-to-one 

\  m  m  /  l,m,  N-l,m 

and  continuous,  it  follows  from  Theorem  3  that  (38)  considered 
as  a  function  of  pn-1  m^  ^as  a  un^-<5ue  local  minimum 

(which  is  the  global  minimum)  in  the  region  specified  by  (40) . 
This  minimum  may  be  obtained  by  standard  numerical  techniques 
for  unconstrained  minimization. 

Numerical  Examples. 

We  illustrate  the  MDM  solutions  corresponding  to  three 
different  signal  processes,  s(t).  These  are: 

(a)  s(t)  =  s^(t)  where  s^(t)  is  a  first  order  Gauss-Markov 
process  with  mean  u  and  unit  variance,  satisfying  the  difference 
equation 

s^t)  =  [s^t-D-Pg  J  +  e1(t)  (41) 

t=0 , ±1 , ±2 ,  .. 
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where  e^(t)  is  a  zero-mean  white  Gaussian  sequence. 

(b)  s(t)  =  s2(t)  where 

s2(t)  =  s^t)  +  Acos  (2irft)  +  Bsin  (2irft)  (42) 

t=0 , ±1 ,±2 ,  .  . 

where  f=*s  and  A  and  B  are  zero-mean  uncorrelated  Gaussian 
random  variables  with  Var  A  =  Var  B  =  1/8. 

(c)  s(t)  =  s^(t)  where  s3(t)  is  a  Gaussian  second-order 
auto-regressive  process  with  mean  yg  and  unit  variance 
satisfying  the  difference  equation 

s  3  ( t)  =  Ms  +  3/4[s3(t-l)-ysj-Js[s3(t-2)-us]+  e3(t)  (43) 

t=0 , ±1 , ±2 ,  ... 


where  e3(t)  is  a  zero-mean  white  Gaussian  sequence. 

In  the  following,  we  illustrate  the  corresponding 
MDM  solutions  for  various  values  of  yg,  masking  variance 
(Vm) ,  and  sample  size  (N) .  Rather  than  give  these  solutions 

A 

in  the  time  domain  (for  instance  by  giving  { Cm ( X- )  ,  $,=1, 2 , .  .N-l}  , 
it  is  generally  more  informative  to  provide  them  in  the 
frequency  domain  in  terms  of  the  corresponding  (N-l)th  order 
AR  spectral  approximations 


SN 

N,m 


_1_ 

2tr 


(44) 
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A  <  V 

where  e„  (b;  A;  j«0, 1, • • • ,N-1}  are  the  AR  parameters 

N , m  ana  j , N 

A  A 

corresponding  to  { Cm ( 0 )  =  Vm,  Cm(l) ,  . ..,  C^fN-l)}. 

Figures  1-3  show  the  MDM  solutions  corresponding  to 

the  three  illustrative  signal  processes  assuming  yg  =  0,  N=20 

and  V  =  10  (Figs.  1-2)  or  V  =3  (Fig.  3  )  (the  ripples  in 
m  m 

Fig.  2  are  caused  by  ringing  inherent  to  the  AR  spectral 

approximation) .  We  note  that  the  MDM  spectra  are  not  identical 

to  the  signal  spectra  and  involve  a  shift  of  spectral  energy 

relative  to  the  signal  from  high  to  low  energy  regions.  We 

refer  to  this  as  a  "partial  whitening"  of  the  MDM.  Intuitively, 

such  an  energy  allocation  is  plausible  since  it  allows  the 

masker  to  dominate  the  signal  in  frequency  regions  where  the 

signal  is  low  with  little  decrease  in  the  masker/signal  ratio 

in  regions  where  the  signal  is  high. 

Figure  4  shows  the  effect  on  the  MDM  of  introducing 

a  non-zero  signal  mean  y  .  As  y  increases,  more  masking 

s  s 

energy  is  concentrated  in  an  impulsive  manner  in  the  dc 

region  of  the  spectrum.  This  impulse  corresponds  in  the  time 

domain  to  the  mean  of  X  (t)  being  a  random  variable  (with  a 

m 

variance  given  by  the  strength  of  the  impluse) .  This  result 
has  an  obvious  intuitive  explanation:  if,  for  instance,  yg>0 
and  the  N-sample  observation  has  a  high  sample  mean  (e.g.,  >  ym) 
the  observer  must  decide  whether  this  is  due  to  the  presence 


Power  (dB/Hz) 


Frequency  (Hz) 


Fig.  2.  Masker  spec trim  corresponding  to  zero-mean  signal 
process  defined  by  equation  (42) . 


Power 


Fig.  3.  Masker  spectrum  corresponding  to  zero-mean  signal 
process  defined  by  equation  (43) . 
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Power  (dB/Hz) 


of  the  signal  or  to  the  absence  of  the  signal  and  the  circum¬ 
stance  that  the  (random)  mean  of  X  (t)  for  this  sample  was 

m 

particularly  high;  conversely,  if  the  sample  mean  is  low, 
this  may  be  due  to  the  absence  of  the  signal  or  to  the 
presence  of  the  signal  and  a  mean  of  Xm(t)  which  is  particularly 
low. 

Figures  5  and  6  show  the  effect  of  increasing  N  on 

the  MDM  for  s(t)  =  s^(t)  and  yg  =  *5.  Note  (Fig.  5)  that 

the  energy  in  the  DC  region  decreases  as  N  goes  from  10  to  40. 

Figure  6  shows  the  optimal  masking  auto-correlation  f ( Jl )  ;  £=0,1,. 

N-l}  for  N  =  5,  10,  20,  and  40.  These  results  suggest  that 

asymptotically  (N-*oo),  the  MDM  solution  for  y  *  0  converges 

s 

to  that  for  yg  =  0.  That  this  is  indeed  the  case  in  general 
(with  a  similar  result  valid  for  the  discrete  component  of 
the  signal  spectrum) ,  is  shown  in  the  next  section,  which 
treats  the  asymptotic  masking  problem. 

V.  The  Asymptotic  Masking  Problem 

Asymptotically  (N-»oo)  the  MDM  problem  becomes  one  of 
determining  the  m(t)  which  minimizes  the  divergence  rate 
R j (1,2)  between  the  m(t)  and  m(t)  +  s(t)  processes.  We  refer 
to  the  masking  process  which  minimizes  Rj(l,2)  as  the  asymptotic 
minimal  divergence  masker  ( AMDM) .  Our  starting  point  for  the 
analysis  of  the  asymptotic  case  is  the  frequency  domain  analog 
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Frequency  (Hz) 


Fig.  5.  Masker  spectra  corresponding  to  signal  process  defined 

by  equation  (41)  with  y  =0.5  and  N  =  10,  20,  40. 

s 
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of  (38)  which  is  obtained  using  (15)  and  (17)  as 


Js(l,2) 


*  T7  E  / 


n=l  -tt  n;m+s 


n;m+s 


where:  S  and  S  are  the  spectra  of  X  and  X  ,  S  .  =  S  +S  , 

m  s  m  s  m+s  m  s 

S  and  S  .  are  the  (n-1) th  order  AR  approximations  to 
n;m  n;m+s  csr 

Sm  and  S  .  ,  and  S  =  S  +  y^6(X),  Sra,e  =  S  +  S  +  (X) . 

m  m+s  m  m  s  m+s  mss 

Substituting  the  expressions  for  Sm  and  ^m+s  into  (45)  gives 


.  N  (  ,2  w2 

JN<1,21  =  47  E  IS  (0 )  +  S  (o: 

n_^ l  n;m  n;m+s 

♦£  [A  -  s^T-M 


Consider  the  case  where  ygie  0  and  the  signal  spectrum 


is  of  the  form 


SsM  -  Ss,oa)  +E  ai  «<»-»!> 


where  S  (X)  denotes  the  portion  of  the  spectrum  continuous  on 
S  /  c 

[0,2tt]  and  the  sum  in  (47)  corresponds  to  a  finite  number  of 

2 

discrete  contributions  to  the  spectrum  of  power  d^  at  the 
frequency  X^^  (i=l,2, . .  ,q<oo) .  (46)  then  becomes 
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This  is  the  general  frequency  domain  expression  for  JN(1,2)  in 
the  context  of  the  masking  problem.  The  MDM  masking  solutions 
illustrated  in  Figs.  1-6  are  best  understood  by  inspection  of 
(48)  . 

The  following  two  theorems  characterize  the  AMDM  solu¬ 
tion  for  processes  of  the  form  (47)  and  provide  a  constructive 
method  for  its  generation  numerically. 


Theorem  4.  Let  the  signal  process  be  zero-mean  and  have  a 

continuous  spectral  density  S  (A)  =  S  (A)  >0.  Then  in  the 

S  S  f  c 

class  M  of  continuous  positive  spectral  densities  with  variance 

V  ,  there  is  a  unique  corresponding  AMDM  process.  Its  spectrum 

£  and  divergence  rate  R_(l,2)  satisfy 
m  J 


:i-2»  -  hf] 
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Sm+S- 
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S  e  M  4tt  j 
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S 

in 

S  +S 
m  s 

Sm  may  be  obtained  to  any  desired  approximation  by  a  straight¬ 
forward  numerical  procedure. 
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Proof. 


That  the  AMDM  solution  -  if  it  exists  -  satisfies 


(49)  follows  from  the  result  that  for  any  S  e  M 

m 


•  —  f" 

4tt  J -7r 


s  s 
n;m  n;m+s 


dX 


S  +S 
m  s 


dX 


(50) 


which  follows  from  (48)  and  (25) .  We  now  show  that  there  is 

a  unique  £  e  M  by  the  explicit  minimization  of  (50) . 

To  minimize  (50) ,  we  introduce  the  Lagrange  multiplier 

£  and  the  functional  (of  S  and  £) 

m 


r 


m 


Sm+Ss 


dX 


■  iC 


S  dX  -  V 
m  m 


(51) 


The  critical  points  of  (51)  satisfy 


r 

-TT 


S  dX  =  V 
m  m 


(52) 


and  also  satisfy  for  each  X: 

[ 


S  I (S  +S  ) ~2  -  s'2  =  £ 

~  ”  m  . 


s  m 


or 


-£-1  (S3  +  2  S  S2)  =  S4  +  2  S3  S  +  S2  S2 
s  ms  m  ms  ms 


(53) 


£  must  be  <0  since  S  and  S  are  >0  .  We  now  show  that  for 

s  m 

fixed  £  <0  and  fixed  X  e  ,  there  is  a  unique  positive 


solution  S  .(X)  of  (53). 
m;  £ 
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Consider  the  graphs  of 

ht(y)  -  -»'x  s2  -  2  r1  sj  y 


g(y)  =  y4  +  2  Ss  y3  +  sj  y2 


for  y>0.  Since  Sg>0  ,  the  slope  of  h^  is  a  positive  constant 

and  h^(0)>0.  Now  g(0)  =  0<h^(0).  Let  be  the  smallest 

positive  abscissa  where  g  and  h^  intersect.  Clearly,  there 

are  intersections  since  g  grows  faster  than  h^.  Now  g'(y)  is 

an  increasing  function  and  g'(5^)  is  necessarily  greater  than 

h^(5^)  (=  a  constant).  Since  g'>h^  for  y>y^,  g>h^  for 

y  >5^  .  Hence,  for  fixed  l,  g  and  h^  intersect  at  exactly  one 

point  in  the  right  half  plane,  and  there  is  a  unique  S  <0  (X)  =  y0 

nij  jc 

which  satisfies  (53)  for  each  X.  Moreover,  it  follows  from 

the  assumed  continuity  of  S  (X)  in  (53)  that  S  „  (X)  is  a 

s  m;  Jo 

continuous  function  of  X. 

In  the  above  manner  we  have  constructed  by  standard 
algebraic  methods  the  family  of  continuous  spectral  densities 

parameterized  by  l,  which  satisfy  the  constraint  (53). 

m,  x, 

We  now  determine  the  unique  &=1t  such  that  S  -**=s  also  satisfies 
the  constraint  (52).  Note  that  for  y^  >  y^*  Hence 

S  .  (A)  >sm.o  (x)  for  a11  A'  and 


P  S  .  dX  >  p  S  c 

m;*2  J-*  mfAl 
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Thus  the  £  for  which  Sm>£  satisfies  (52)  is  unique,  “t  may  be 
found  by  a  straight-forward  binary  search:  start  with  £^  and 
£  2  >  £■  i  such  that 


dX  >  V  , 
m 


Determine  £_  and  £.  such  that 
3  4 


dX  < 


V  ,  / 
m  •'-it 


and  l ^  -  £^  =  *3 ( S'2— ^l)  '  *  *  *  an^  so  on*  T^e  sequence  of  inter¬ 
vals  [^2n+l'  ^2n+2-J  conver9es  to  The  search  process  may  be 

terminated  when  an  £  is  obtained  for  which  S  -^dX^V  to 

J-i r  m;  £  m 

any  desired  degree  of  approximation.  is  then  the  desired 

IH7  X/ 

approximation  to  s‘m-  This  completes  the  proof  of  Theorem  4. 


Rj(l,2)  is  a  monotonically  decreasing  function  of 


Proof.  Let  p_(X)  =  S  (X)/V  and  p  (X)  =  S  (X)/V  denote  the 
-  s  s  s  m  m  m 

normalized  signal  and  masker  spectral  densities.  We  have 


42 


r 


tWiWwSJtf..-.- - 


Since  the  last  expression  is  a  monotonically  decreasing  function 
of  y»  the  corollary  then  follows  from  (49). 

Theorem  5.  Let  the  signal  process  have  mean  0  and  a 
spectrum 

S  (X)  =  Sc  (X)  +  V  d?  6  (X-X . )  (54) 

i=l 

as  in  (47) .  Then  the  corresponding  AMDM  process  is  in  the  class 
M  and  is  the  AMDM  process  corresponding  to  the  zero-mean  signal 
process  with  spectrum  S  (X)  (i.e.  the  mean  of  the  signal  and 

S  f  c 

the  discrete  component  of  its  spectrum  do  not  effect  its  AMDM) . 


Proof.  Consider  masker  processes  of  the  form 

VX)  “  Sm,c(X>  +  SM  ei  S<x-V  <55> 

i=l 

where  c  is  the  continuous  spectral  component,  eQS(X)  is  a 

discrete  component  corresponding  to  y  *  0  ,  and  the  sum  on  the 

s 

right  represents  discrete  components  corresponding  to  the  dis- 

q 

Crete  components  ofS.  LetV  ,  =  Vs  e?  and  V  =  V  -V  . 

s  m,d  i  m,c  m  m,  d 

i=0 

be  the  partioning  of  the  masker  variance  between  the  discrete 

and  continuous  portions  of  the  spectrum.  Assuming  that  each 
2 

of  the  are  >0,  we  have  for  the  corresponding  AR  approxima¬ 
tions 
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lim  S„  m(0)  =  lim  t  (X.) 

n->  oo  n,m  n  — oo  n,m  1 


=  lim  S  ,  (X . )  =  oo 
n-»oo  n'm+s  1 


(56) 


i=l* 2 , . . , q 


From  (48)  ,  (56)  and  (28)  we  have 


R  (1,2)  =  lim  J„(l,2)  -  , (1,2) 

J  N— »oo 


1_ 

4tt 


lim 
n  -»oo 


s ,  c 
n,m 


S 

s,  c 
c 

n, m+s  . 


dX 


_1_ 

4rr 


S 

s,c 

S  +S 
m, c  s , c 


dX 


(57) 


2 

For  fixed  V  =  V  -  V*  £ .  ,  the  AMDM  masker  is  then 
m,  c  m  t—i  i 

i=  0 

fully  characterized  by  Theorem  4  and  is  obtained  by  minimizing 

(57)  with  respect  to  the  continuous  component  Sm  c  under  the 

constraint  |  F  S„  „  =  V  .  Denote  the  corresponding  minimal 
y_Tf  m,c  m,c  r  3 

divergence  rate  by  Rj(l,2;Vm  c) .  However,  by  the  corollary  to 

Theorem  4,  Rj(l,2;Vm  c)  is  a  monotonically  decreasing  function 

of  V  .  Thus  for  the  AMDM  solution,  V  =  V  ,  V  ,  =  0  and 
m,c  m,c  m  m,d 

the  theorem  follows. 


Remark  -  The  fact  that  the  AMDM  masker  spectrum  has  no  dis¬ 
crete  components  even  if  such  components  are  present  in  the 
signal,  is  best  motivated  by  the  following  observation  [6] : 
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i=},2 


consider  two  Gaussian  processes  with  spectra 
S.  (A)  =  S.  (A)  +  V  a?  .  6  (A-A . ) 

f  i  9  J  J 

2 

where  the  a  .  .>0  for  i=l,  2 ;  j  =  l,  2 , . .  .  , q <  oo.  Then  the  asymptotic 

1 1  J 

(N— »oo)  probability  of  error  for  distinguishing  between  the  two  pro¬ 
cesses  is  >0  if  and  only  if  S.  (A)  =  S,  C A )  (a.e.).  The  fact  that 

asymptotically  non-singular  detection  depends  on  the  equality 
of  S^c  and  S2c  but  not  on  the  equality  of  the  {a^}  and  {a2j}, 
suggests  that  the  AMDM  place  asymptotically  vanishing  energy 
in  the  discrete  spectral  components  so  as  to  maximize  the 
similarity  of  the  masker  and  masker  plus  signal  continuous 
spectral  components,  as  is  indicated  by  Theorem  5. 
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